Efficiency of Pregroups and the French Noun Phrase
نویسندگان
چکیده
We study mathematical and algorithmic properties of Lambek’s pregroups and illustrate them by the French noun phrase. An algorithm of complexity 3 n to solve the reduction problem in an arbitrary free pregroup as well as recognition by a pregroup grammar is presented. This algorithm is then specified to run in linear time. A sufficient condition for a language fragment that makes the linear algorithm complete is given. Introduction Pregroups are a recent mathematical tool introduced in [Lambek 99] for natural language processing. They arose as a simplification of the Lambek calculus, first published under the name of Syntactic Calculus in [Lambek 58]. A pregroup grammar consists of a free pregroup and a pregroup dictionary, i.e. a finite relation which associates finitely many elements of the pregroup, the so-called types, to each word. [Buszkowski] has shown that pregroup grammars are weakly equivalent to context-free grammars. Well known algorithms like [CYK] or [Earley] for context-free grammars solve the problem of the grammaticality of strings of words in time proportional to the cube of the length of the string. However, it would not only be clumsy and inefficient to translate a pregroup grammar to a context-free grammar (this grammar would include the whole dictionary in its set of rules), but it would also go against the pregroups spirit. Indeed, pregroups were conceived as a tool which “processes the words as you hear them”. For each word you hear, you choose a type from the (mental) dictionary. The grammaticality of a string of words from the dictionary is then decided by performing computations on the corresponding string of types in the pregroup. So far, much of the work with pregroups has been aimed at establishing the universality of the rules, which are the same whatever the language only the dictionary changes. Fragments of Arabic, Japanese, Latin [MathLing], Polish [Kislak], English [Lambek 99, 03], French [Barg Lamb], German [Lamb-Prel NP, SE], Italian [Cas-Lamb] among others have been analysed with the help of pregroups.
منابع مشابه
1 Sylvain
We study mathematical and algorithmic properties of Lambek’s pregroups and illustrate them by analysing the French noun phrase. We establish robustness properties of pregroups and present a simple algorithm of complexity 3 n for deciding reduction in an arbitrary free pregroup as well as a linear algorithm covering a large class of language fragments. In the French noun phrase, the agreement of...
متن کاملInvestigating Embedded Question Reuse in Question Answering
The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...
متن کاملDo Heavy-NP Shift Phenomenon and Constituent Ordering in English Cause Sentence Processing Difficulty for EFL Learners?
Heavy-NP shift occurs when speakers prefer placing lengthy or “heavy” noun phrase direct objects in the clause-final position within a sentence rather than in the post-verbal position. Two experiments were conducted in this study, and their results suggested that having a long noun phrase affected the ordering of constituents (the noun phrase and prepositional phrase) by advanced Iranian EFL le...
متن کاملبررسی تأثیر پیچیدگی نحوی ساختار گروه اسمی و فعلی بر وقوع لکنت در کودکان لکنتی پیشدبستانی ۶-۴ سال فارسیزبان
Objective: The purpose of the present research was to investigate the effect of syntactic complexity of noun phrase and verb phrase on the occurrence of stuttering in 4-6 year Persian speaking children with stuttering. Materials & Methods: This descriptive-analytic research was done on 15 stuttering children, consisting of 12 boys and 3 girls, 4 to 6 years old monolingual Persian speaking wh...
متن کاملAn Algorithm for finding Noun Phrase Correspondences in Bilingual Corpora
94304 Abstract The paper describes an algorithm that employs English and French text taggers to associate noun phrases in an aligned bilingual corpus. The tag-gets provide part-of-speech categories which are used by finite-state recognizers to extract simple noun phrases for both languages. Noun phrases are then mapped to each other using an iterative re-estimation algorithm that bears similari...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Logic, Language and Information
دوره 14 شماره
صفحات -
تاریخ انتشار 2005